<HTML>

<HEAD>
<META http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> 
<TITLE>Extractor: Release History</TITLE>
</HEAD>

<BODY BGCOLOR="#FFFFFF">

<MAP NAME="banner_top">
<AREA SHAPE="rect" COORDS="588,14,620,40" 
    HREF="http://www.iit.nrc.ca/english.html">
<AREA SHAPE="rect" COORDS="538,14,583,37" 
    HREF="http://www.nrc.ca/corporate/english/">
<AREA SHAPE="rect" COORDS="86,4,421,37" 
    HREF="http://www.iit.nrc.ca/II_public/index.html">
</MAP>

<IMG SRC="banner_top.jpg" width="620" height="37" 
    alt="II Group Banner" USEMAP="#banner_top"
ISMAP border="0"><BR><IMG SRC="banner_extractor.jpg" width="217" 
    heigth="49" alt="Extractor">

<H1><FONT COLOR="#400080">Extractor: Release History</FONT></H1>

<SMALL><I>Extractor 7.2, Revised December 4, 2001</I></SMALL><BR>
<SMALL><I>Copyright &copy; 2001, National Research Council of Canada</I></SMALL>

<HR>

<TABLE WIDTH="100%">

<TR BGCOLOR="CCCCCC">
<TD WIDTH=100><B>Version Number</B></TD>
<TD WIDTH=150><B>Release Date</B></TD>
<TD><B>Description of Changes</B></TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 7.2</TD>
<TD>December 7, 2001</TD>
<TD>
<UL>
<LI>
improvements to keyphrases and highlights in English, French, German, and
Spanish</LI>
<LI>
increased error checking, to warn of incorrect usage of the API</LI>
<LI>
increased support for special punctuation characters</LI>
<LI>
improvements to wrappers for calling Extractor from Java, Perl, and
Python, in Windows, Linux, and Solaris
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 7.1</TD>
<TD>June 1, 2001</TD>
<TD>
<UL>
<LI>
new API function for deactivating the plain text filter (useful for writing
custom filters)</LI>
<LI>
changes to the source code to facilitate the use of customized memory routines</LI>
<LI>
improved handling of hyphens</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 7.0</TD>
<TD>November 27, 2000</TD>
<TD>
<UL>
<LI>
handles English, French, Japanese, German, Spanish, and <B>Korean</B> text</LI>
<LI>
choice of three Korean character encodings: EUC-KR (KS C 5601-1987),
Johap (KS X 1001:1992 alternate encoding), and Unicode UCS-2 (double-byte 
character code, using native byte ordering)</LI>
<LI>
new <B>go phrase</B> feature allows user to specify important words and phrases</LI>
<LI>
improvements to highlights in all languages</LI>
<LI>
improvements to keyphrases in German</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 6.1</TD>
<TD>September 7, 2000</TD>
<TD>
<UL>
<LI>
improvements to Japanese highlights</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 6.0</TD>
<TD>July 17, 2000</TD>
<TD>
<UL>
<LI>
handles English, French, Japanese, German, and <B>Spanish</B> text</LI>
<LI>
new API function for finding how many words were read</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="FFEEEE">
<TD>Extractor 5.1</TD>
<TD>January 21, 2000</TD>
<TD>
<UL>
<LI>
extracts <B>key sentences</B> (highlights) in addition to keyphrases</LI>
<LI>
important phrases inside highlights can be automatically marked bold</LI>
<LI>
unimportant words inside highlights can be automatically marked grey</LI>
<LI>
improved filtering of e-mail; attachments processed according to MIME type
<LI>
improved filtering of HTML</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="FFEEEE">
<TD>Extractor 5.0</TD>
<TD>July 6, 1999</TD>
<TD>
<UL>
<LI>
handles English, French, Japanese, and <B>German</B> text</LI>
<LI>
improved filtering of HTML</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 4.1</TD>
<TD>May 6, 1999</TD>
<TD>
<UL>
<LI>
improved keyphrases</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 4.0</TD>
<TD>March 22, 1999</TD>
<TD>
<UL>
<LI>
handles English, French, and <B>Japanese</B> text</LI>
<LI>
choice of four Japanese character encodings: JIS, Shift-JIS, EUC-JP, 
and Unicode UCS-2 (double-byte character code, using native byte 
ordering)</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 3.3</TD>
<TD>February 1, 1999</TD>
<TD>
<UL>
<LI>
quality of the keyphrases has been further improved, especially
for French text</LI>
<LI>
some improvements to handling of HTML</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 3.2</TD>
<TD>December 14, 1998</TD>
<TD>
<UL>
<LI>
user may now request from 3 to 30 keyphrases (previously 5 to 15)</LI>
<LI>
quality of the keyphrases has been further improved</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 3.1</TD>
<TD>September 18, 1998</TD>
<TD>
<UL>
<LI>
<B>fully reentrant</B>, to allow 
<A HREF="threads.html">
multithreading</A> without the use of 
Win32 services such as semaphores and the EnterCriticalSection and 
LeaveCriticalSection functions</LI>
<LI>
added arguments to ExtrAddStopWord and ExtrAddStopPhrase, to
specify character code</LI>
<LI>
added support of Unicode UCS2 double-byte character codes,
using native byte ordering</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEEEFF">
<TD>Extractor 3.0</TD>
<TD>April 30, 1998</TD>
<TD>
<UL>
<LI>
handles both <B>French</B> and English&nbsp;</LI>
<LI>
new API functions for French / English language options</LI>
<LI>
better API method for specifying desired number of keyphrases</LI>
<LI>
e-mail filter handles MIME quoted-printable accents</LI>
<LI>
HTML filter handles HTML escape sequences for accents and ISO Latin-1 HTML
character entities</LI>
<LI>
handles both ISO Latin-1 and MS-DOS Code Page 437 character codes&nbsp;</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="FFEEEE">
<TD>Extractor 2.0</TD>
<TD>January 27, 1998</TD>
<TD>
<UL>
<LI>
new API function for controlling number of phrases</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.7</TD>
<TD>December 19, 1997</TD>
<TD>
<UL>
<LI>
new API function for finding numerical score of keyphrase</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.6</TD>
<TD>November 26, 1997</TD>
<TD>
<UL>
<LI>
improved documentation</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.5</TD>
<TD>September 11, 1997</TD>
<TD>
<UL>
<LI>
improved keyphrases</LI>
<LI>
better filtering of HTML</LI>
<LI>
better filtering of e-mail</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.4</TD>
<TD>July 16, 1997</TD>
<TD>
<UL>
<LI>
first version with 
<A HREF="api.html">
API</A> and <B>DLL</B></LI>
<LI>
can be embedded in other software</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.3</TD>
<TD>June 11, 1997</TD>

<TD>
<UL>
<LI>
improved keyphrases</LI>
<LI>
better filtering of HTML</LI>
<LI>
better filtering of e-mail</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.2</TD>
<TD>April 16, 1997</TD>
<TD>
<UL>
<LI>
improved keyphrases</LI>
<LI>
more stop words</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.1</TD>
<TD>January 17, 1997</TD>
<TD>
<UL>
<LI>
improved interface</LI>
<LI>
simplified output</LI>
</UL>
</TD>
</TR>

<TR BGCOLOR="EEFFEE">
<TD>Extractor 1.0</TD>
<TD>January 9, 1997</TD>
<TD>
<UL>
<LI>
first release of Extractor</LI>
<LI>
<B>English</B> only</LI>
</UL>
</TD>
</TR>
</TABLE>

<BR>

<P>

<HR>

<CENTER>
<table border="1" bgcolor="#ccccff">
<tr><td><font size=2>
[ <a href="http://extractor.iit.nrc.ca/">Extractor Home</a> |
<A HREF="http://www.iit.nrc.ca/II_public/french.html">Fran&ccedil;ais</a> |
<A HREF="http://www.iit.nrc.ca/english.html">IIT</A> |
<A HREF="http://www.iit.nrc.ca/II_public/index.html">II Group</A> |
<A HREF="http://www.nrc.ca/corporate/english/">NRC</A> |
<A HREF="http://ai.iit.nrc.ca/search.html">Search</A> |
<A HREF="mailto:Peter.Turney@nrc.ca">Feedback</A> ]
[ <I>Updated</I>: December 4, 2001 ]</font size=2></td></tr>
</table>
</CENTER>

</BODY>
</HTML>



